Rapid speaker adaptation by reference model interpolation

نویسندگان

Wen Xuan Teng

Guillaume Gravier

Frédéric Bimbot

Frédéric Soufflet

چکیده

We present in this work a novel algorithm for fast speaker adaptation using only small amounts of adaptation data. It is motivated by the fact that a set of representative speakers can provide a priori knowledge to guide the estimation of a new speaker in the speaker-space. The proposed algorithm enables an a posteriori selection of reference models in the speakerspace as opposed to the a priori selection of reference speaker-space commonly used in techniques such as Eigenvoices. We compare the proposed algorithm with the common rapid adaptation techniques within the context of phoneme recognition task. Experimental results on the IDIOLOGOS and PAIDIALOGOS corpus [1] show that the proposed algorithm achieves slightly better improvement than classic Eigenvoices in phoneme accuracy rate, especially for atypical speakers such as children.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rapid Speaker Adaptation With Speaker Clustering

This thesis addresses the problem of rapid speaker adaptation. This is the task of altering the parameters of a speaker dependent speech recognition system so as to make that system look more like a speaker dependent system using a very small amount (<10 seconds) of speaker specific data. The approach to speaker adaptation taken in this work is called speaker cluster weighting (SCW). SCW extend...

متن کامل

Rapid adaptation using penalized-likelihood methods

In this paper, we introduce new rapid adaptation techniques that extend and improve two successful methods previously introduced, cluster weighting (CW) and MAPLR. First, we introduce a new adaptation scheme called CWB which extends the cluster weighting adaptation method by including a bias term and a reference speaker model. CWB is shown to improve the adaptation performance as compared to CW...

متن کامل

Improved Bayesian learning of hidden Markov models for speaker adaptation

We propose an improved maximum a posteriori (MAP) learning algorithm of continuous-density hidden Markov model (CDHMM) parameters for speaker adaptation. The algorithm is developed by sequentially combining three adaptation approaches. First, the clusters of speaker-independent HMM parameters are locally transformed through a group of transformation functions. Then, the transformed HMM paramete...

متن کامل

Cluster adaptive training for speech recognition

When performing speaker adaptation there are two conicting requirements. First the transform must be powerful enough to represent the speaker. Second the transform must be quickly and easily estimated for any particular speaker. Recently the most popular adaptation schemes have used many parameters to adapt the models. This limits how rapidly the models may be adapted. This paper examines an ad...

متن کامل

Reconstructing voices within the multiple-average-voice-model framework

Personalisation of voice output communication aids (VOCAs) allows to preserve the vocal identity of people suffering from speech disorders. This can be achieved by the adaptation of HMM-based speech synthesis systems using a small amount of adaptation data. When the voice has begun to deteriorate, reconstruction is still possible in the statistical domain by correcting the parameters of the mod...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Rapid speaker adaptation by reference model interpolation

نویسندگان

چکیده

منابع مشابه

Rapid Speaker Adaptation With Speaker Clustering

Rapid adaptation using penalized-likelihood methods

Improved Bayesian learning of hidden Markov models for speaker adaptation

Cluster adaptive training for speech recognition

Reconstructing voices within the multiple-average-voice-model framework

عنوان ژورنال:

اشتراک گذاری